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Abstract. The Minkowski Question Mark function relates the continued-fraction rep- 
resentation of the real numbers to their binary expansion. This function is peculiar in 
many ways; one is that its deiivative is 'singular'. One can show by classical techniques 
that its derivative must vanish on all rationals. Since the Question Mark itself is contin- 
uous, one concludes that the derivative must be non-zero on the irrationals, and is thus a 
discontinuous-everywhere function. This derivative is the subject of this essay. 

Various results are presented here: First, a simple but formal measure-theoretic con- 
struction of the derivative is given, making it clear that it has a very concrete existence as a 
Lebesgue-Stieltjes measure, and thus is safe to manipulate in various familiar ways. Next, 
an exact result is given, expressing the measure as an infinite product of piece-wise contin- 
uous functions, with each piece being a Mobius transform of the form {ax + b)/ [ex + d). 
This construction is then shown to be the Haar measure of a certain transfer operator. A 
general proof is given that any transfer operator can be understood to be nothing more 
nor less than a push-forward on a Banach space; such push-forwards induce an invariant 
measure, the Haar measure, of which the Minkowski measure can serve as a prototypi- 
cal example. Some minor notes pertaining to it's relation to the Gauss-Kuzmin-Wirsing 
operator are made. 



1. Introduction 

The Minkowski Question Mark function was introduced by Hermann Minkowski in 
1904 as a continuous mapping from the quadratic irrationals to the rational numbers 1 23 1. 
This mapping was based on the consideration of Lagrange's criterion, that a continued 
fraction is periodic if and only if it corresponds to a solution of a quadratic equation with 
rational coefficients. Minkowski's presentation is brief, but is based on a deep exploration 
of continued fractions. Minkowski does provide a graph of the function; a modern version 
of this graph is shown in figure[T] 

Arnaud Denjoy gave a detailed study of the function in 1938|6|, giving both a simple 
recursive definition, as well as expressing it as a summation. Its connection to the continued 
fractions allowed him to provide explicit expressions for its self-similarity, described as 
the action of Mobius transformations. Although Denjoy explores a generalization that is 
defined for the non-negative real number line, this essay will stick to Minkowski's original 
definition, defined on the unit interval. Little is lost by sticking to the original form, while 
keeping the exposition simpler. 

For rational numbers, Denjoy's summation is written as 



N 

(1.1) ?(x) =2^2 (-1)''^' 2-("'+"2+-+«*) 

k=l 
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Figure 1.1. The Minkowski Question Mark Function 

Minkowski ? Function 




X 



where the o/. are the integers appearing in the continued fraction lflSl expansion of x: 

X = [ao; 01,02, • • • ,OAf] = oo H j 



For the case of general, irrational x, one simply takes the limit N ^ °°; it is not hard to 
show that the resulting function is well-defined and continuous in the classical sense of 
delta-epsilon limits. 

The function is symmetric in that ? (1 — x) ~ 1 — ? (x), and it has a fractal self-similarity 
which is generated by 1271 

An equivalent, alternative definition of the function can be given by stating that it is a 
correspondence between rationals appearing in the Stern-Brocot, or Farey tree, and the 
binary tree of dyadic rationals, shown in figure [U it maps the Farey tree into the dyadic 
tree. 

Many insights into the nature and structure of the function, including self-similarity 
and transformation properties, can be obtained by contemplating the repercussions of this 
infinite binary-tree representation. Most important is perhaps the insight into topology: the 
presence of the tree indicates a product topology, and guarantees that the Cantor set will 
manifest itself in various ways. 

The Question Mark has many peculiarities, and one is that it's derivative is "singular" 
in an unexpected way|25|. One can show, by classical techniques, that its derivative must 
vanish on all rationals: it is a very very flat function, as it approaches any rational. Since 
the Question Mark itself is continuous, one concludes that the derivative must be non- 
zero, and infinite, on the irrationals, and is thus a discontinuous-everywhere function. The 
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derivative of the Minkowski Question Mark function is interpretable as a measure, and is 
often called the 'Minkowski measure'. It is this measure that is the subject of this essay. 

The derivative has been a subject of recent study. Dushistova, et al. show that the 
derivative vanishes not only on the rationals, but also on a certain class of irrationals |9|; 
this refines a result from Paridis[24|. Kessebohmer computes the Hausdorff dimension of 
the sets on which the deriviative is non-zero lflTl . 

The remainder of this text is structured as follows: the next section introduces the de- 
rivative, or 'Minkowski measure', as the statistical distribution of the Farey numbers over 
the unit interval. The third section reviews the lattice-model or cylinder-set topology for 
binary strings, while the fourth presents an exact result for the Minkowski measure in this 
topology. This results disproves a conjecture by Mayerl22J that the distribution is given by 
the Kac model lfT6l . In the fifth section, the language of lattice models is converted back 
to the standard topology on the real number line, to give an exact result for the measure 
as an infinite product of piece-wise C°° functions, with each piece taking the form of a 
Mobius transform [ax + b) / {cx + d). The sixth section provides a short note on modular- 
ity; the seventh reviews an application, the eighth some generalizations. The nineth section 
introduces the transfer operator, the tenth section provides an abstract theoretical frame- 
work, demonstrating that the transfer operator should be regarded as a pushforward. This 
is found to induce an invariant measure, the Haar measure. The next section provides a 
grounding for the the previous abstract discussions, illustrating each theoretical construc- 
tion with its Minkowski measure analog. The next two sections review a twisted form of 
the Bernoulli operator, and the Gauss-Kuzmin-Wirsing operator, respectively. The papaer 
concludes with a call to develop machinery to better understand the discrete eigenvalue 
spectrum of these operators. 

2. The Minkowski Measure 

It can be shown that the Stern-Brocot or Farey tree enumerates all of the rationals in 
the unit interval exactly once l|26l l5]|. By considering this tree as a statistical experiment, 
as the source of values for a random variable, one obtains a statistical distribution; this 
distribution is exactly the Minkowski measure. 

The statistical experiment may be run as follows: consider the first rows of the tree: 
these provide a pool of 2^ rationals. A random variable Xn is then defined by picking 
randomly from this pool, with equal probability. Ultimately, the goal is to consider the 
random variable X — YimN^ooXN, and understand its distribution. From the measure-theory 
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Figure 2.1. Distribution of Farey Fractions 



Distribution of Farey Numbers 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 



p/q 

This figure shows a histogram of the first 2^ — 65536 Farey fractions according to their 

location on the unit interval. To create this figure, the unit interval is divided up into 
M — 6000 bins, and the bin count c„, for each bin m is incremented whenever a fraction 
p/q falls within the bin: [m — \ ) / M < p / q < m/ M. At the end of binning, the count is 
normalized by multiplying each bin count c„ by . The normalization ensures that 

the histogram is of measure one on the unit interval: that Yl^i=QCm = 1- 
The shape of this histogram is dependent on the choice of M and A^. However, it does 
exactly capture the results of performing an experiment on the random variable and 
tallying the results into evenly-spaced bins. The result is 'exact' in the sense of eqn l2.1l 
the height of each bin is precisely given by the differences of the value of ? at its edges, 
up to the granularity imposed by A^. AsN ^ the errors are uniformly bounded and 

vanishing. 

perspective, one is interested in the the probability Pr that a measurement of X will fall in 
the interval [a,b] of the real number line. The distribution is the measure (x) underlying 
this probability: 

rb pb 

Pr[a<X <b]= dn= ^ (x) dx 

J a J a 

It is straightforward to perform this experiment, for finite A^, on a computer, and to graph 
the resulting distribution. This is shown in figure lTTI What is most notable about this figure 
is perhaps that it clearly belongs to a class of functions called 'multi-fractal measures' llT2l 

EH. 

The measure pL is known as the 'Minkowski measure'; its relationship to the Question 
Mark function is straight-forward, and may be stated as a formally: 
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Theorem. The integral of the Stern-Brocot distribution is the Minkowski Question Mark 
function; that is, 

(2.1) l{b)~l{a)^ t dll 

J a 

Proof. The result follows from the duality of the ordinary dyadic tree and the Stern-Brocot 
tree. Consider a different statistical experiment, performed much as before, but where 
this time, one picks fractions from the dyadic tree. Call the resulting random variable Y, 
understood as the limit of discrete variables with N ^ °°. Since the dyadic rationals 
are uniformly distributed on the unit interval, the corresponding measure is trivial: its 
just the uniform distribution. Yet this 'trivial' experiment differs from that giving X only 
by the labels on the tree. Performing the same experiment, in the same way, but just with 
different labellings of the values, is simply a mapping from one set to another; the mapping 
is already given by eqn il.ll it is ? [x). The transformation of measures under the action 
of maps is a basic result from probability theory; the relationship between the random 
variables is simply that Y^ =1{Xn) and Y =1{X). The measure, of course, transforms 
inversely, and so by standard measure theory, one has 

Vx[a<X <b] =?(/?) -?(fl) 

essentially, concluding the proof of the assertion 12. II □ 

Because the equation l2. ll resembles a classical integral, one is sorely tempted to 'abuse 
the notation', and use the classical notation to write: 

(2.2) '^JM^r{x)=^{x) 

dx 

The use of the derivative notation in the above is rather misleading: properly speaking, the 
derivative of the Question Mark is not defined when using the standard topology on the real 
number line. However, the ti'eatment of ^ as a measure can be very rigorously founded. 
Specifically, /i can be seen to be an example of an outer measure; by the measure extension 
theorem (see, for example |20]), an outer measure can be extended to a unique measure on 
the unit interval of the real number line. 

This is done as follows. Let 2 = {0, 1} be the set containing two elements; let 2"' be 
the (semi-)infinite product space 2 x 2 x 2 x • • • . This space consists of all (infinitely-long) 
strings in two letters. An outer measure is a set function v : 2™ ^ [0,°°] that is monotone 
{i.e. V (A) < V (B) whenever A C B C 2'"), that is sigma-subadditive (i.e. given any count- 
able set Ai , A2, . . . of subsets of 2" and A C Ur=o^*:' then v (A) < YX^q V (A^-) ) and that 
V of the empty set is zero. It is straight-forward to show that v =?'o?^^ satisfies each of 
these properties for an outer measure; this essentially follows from the total ordering of the 
fractions on the Stern-Brocot tree (as well as the total ordering of the dyadic rationals on 
the dyadic tree). The remaining step is to notice that the "natural" topology on l"' is the so- 
called "weak topology" (sometimes called the "cylinder set topology"), and that the weak 
topology is finer than the natural topology on the real number line. That is, every subset 
of the unit interval of the real number line is also a subset of 2'*'. The measure extension 
theorem then states that there is a single, unique measure v on the unit interval of the real 
number line that is identical to v on all subsets of the unit interval. 

It is in this way that ?'o?^' should be understood as a measure on the unit interval, 
and, more precisely, a Lebesgue-Stieltjes measure. Since ? is continuous and invertible 
(one-to-one and onto), then ?' is a Lebesgue-Stieltjes measure as well. It is in this way that 
the use of the word "derivative" is not entirely inappropriate: Lebesgue-Stieltjes measures 
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capture the essence of integration, and insofar as the integral is an anti-derivative, one can 
fairly casually talk about derivatives. In particular, later sections below will make use of 
the chain rule for derivatives (? o/)' = (?' o /) • /'; the vaUdity of the chain rule in this 
context works, because it can be seen to be a change-of-variable of a Lebesgue-Stieltjes 
integral. 

The construction of ?'o?~i is such that it is also a measure (an outer measure!) on the 
even finer topology of 2"'. All these various points will be impUcit in what follows, where 
a more casual language will be adopted. 

The product topology is also the setting for the so-called 'lattice models' of statistical 
physics. The next section exphcitly develops the basic vocabulary of lattice models. The 
notion of fractal self-similarity can then be explicitly equated to shifts (translations) of the 
lattice. Equipped with this new vocabulary, it becomes possible to give an exact expression 
for the Minkowski measure fj, to be expressed as an infinite product of Mobius functions. 

3. Lattice MODELS 

This section provides a review of the language of lattice models, setting up a minimal 
vocabulary needed for the remaining development. The use of lattice models is partly sug- 
gested by the developments above: the natural setting for probability theory is the product 
topology, and the natural setting for the product topology is the lattice model. The specific 
applicability of lattice models to the Minkowski measure is made clear in a later section. 

Consider again the set of all possible (infinite length) strings in two letters. Each such 
string can be represented as CJ = (ao, CJi , 02, . . .) where G/^ G { — These strings can 
be trivially equated to binary numbers, by writing the ^'th binary digit as = (a^ + 1 )/2, 
so that for each configuration a, one has the real number 0<x<l given by 

(3.1) x((7) = £^,2-(*+i) 

k=0 

In physics, such strings are the setting for one-dimensional lattice models, the most famous 
of which is the Ising model. One considers each number k to represent a position in space, 
a lattice point. At each lattice point, one imagines that there is an atom, interacting with 
its neighbors, forming a perfect crystal. The goal of physics is to describe the macroscopic 
properties of this crystal given only microscopic details. There are two critical details that 
enter into all such models: the principle of maximum entropy, and translational invariance. 
These two principles, taken together, allow the Minkowski measure to be specified in an 
exact way. 

The principle of maximum entropy states that all possible states of the model should 
be equally likely, subject to obeying any constraints that some or another average value 
or expectation value of a function of the lattice be fixed. There is a unique probability 
distribution that satisfies the principle of maximum entropy, it is known as the Boltzmann 
distribution or Gibbs measure. It is given by 

(3.2) Pr((j) = ^exp(-/3//((7)) 

where Pr (a) denotes the probability of seeing the configuration a. The normalizing factor 
Z (£2) is known as the partition function, it exists to ensure that the sum of all probabiUties 
totals unity. That is, one has 

Z{a)= £ exp(-j3//((7)) 
crsfl 
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where Q. is the set of all possible configurations (7. The function H {a) is a real-valued 
function, conventionally called the 'energy' or 'Hamiltonian' in physics. The principle of 
maximum entropy interprets the energy as something whose expectation value is to be kept 
constant; that is, one adjusts the free parameter /3 in order to obtain a desired value for the 
expectation value 

The second principle commonly demanded in physics is that of translational invariance: 
that, because the crystalline lattice 'looks the same everywhere', all results should be in- 
dependent of any specific position or location in the lattice. In particular, translational 
invariance implies that H (o) should have the same form as one slides around left or right 
on the lattice. In the present case, the lattice is one-sided, and so one can slide in only 
one direction. Defining the shift operator T which acts on strings as t(oo,(Ji,02,...) = 
(cTi , (72, (73, • • •)' the principle of translation invariance states that we should only consider 
functions Hip) obeying the identity 

Unfortunately, for the present model, this cannot quite hold in the naive sense: the lattice 
is not double sided (with strings of symbols extending to infinity in both directions), but 
one-sided. The best one can hope for is to isolate the part that acts just on the first ' atom' in 
the left-most lattice location; call this part V (a), and subtract it from total: thus, instead, 
translational invariance suggests that, for the one-sided lattice, one should write 

(3.3) //(cj) = y((7)+//(Ta) 

That is, the total 'energy' contained in the lattice is equal to the energy contained in the first 
lattice position, plus the energy contained in the remainder of the lattice. That the energy 
should be additive (as opposed to being multiplicative, or combined in some other way) 
is consistent with its use as a log-linear probability in the Gibbs measure; that is, the use 
of the Gibbs measure, together with insistence that the model be translationaUy invariant, 
implies that the energy must be additive. 

From this, it is is straight-forward to decompose H (cr) into a sum: 

(3.4) //((7) = £y(T*(7) 

What remains is the 'potential' V (a), which embodies the specifics of the problem at hand. 
The primary result of this note follows from the application of the above mechanics to the 
Minkowski measure: that is, to assume that the Minkowski measure is a Gibbs measure, 
and to solve for the potential V (cr). 

Before doing so, there is one more relationship to make note of: the shift operator t, 
when looked at from the point of view as a function on real numbers, looks like multiplication- 
by-two. To be precise, one has the identity 



tx{a)- \l^x{a)\ =x(t*cj) 



where \y\ denotes the floor of y. The left-hand side of the above is a central ingredient of 
the construction of the Takagi or 'blancmange' curve; such a curve may be written as 



*;=0 
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for various functions / of the unit interval. Such functions have many curious fractal self- 
similarity properties, and are explored in greater detail in 1281 IZTll . Writing W = / ojc 
shows that the Blancmange curve has the form 

k=0 

when considered on the lattice. The mapping given by eqn l3.1l is not just a mapping be- 
tween strings and real numbers, it is a mapping from the product topology to the natural 
topology of the real number line. Thus, suddenly, many fractal and self-similarity phenom- 
ena are seen to be manifestations of simple structures in the product topology, which are 
then mapped to the natural topology. 

4. Self-Similarity 

Why should one even consider the application of lattice models to the Minkowski mea- 
sure? The core justification is that the shift operator T essentially acts as a multiplication- 
by-two, when considered on binary the numbers x{a) of eqn l3.1l together with the divide- 
by-two self-similarity relation in eqn ll.2l Thus, one posits that there is a probability distri- 
bution 

(4.1) Pr((j) = (?'o?-i)(x((j)) 

that is described by the Gibbs measure. The appearance here of the inverse (x) is so as 
to be able to work 'in the same space' as where one started: that is, (x) is a map that 
takes one from the space of dyadics to the space of Farey fractions, where an operation is 
performed that coincidentally takes one back to the space of dyadics. Seen in this way, eqn 
14. H is essentially the reciprocol of a Jacobian. 

In 1991, Dieter Mayer hvpothesized ll22l that the distribution ^, ll was given by the Kac 
model fT6l, and specifically, by the Kac potential 



(4.2) V{x) 




for < X < 1 /2 
for 1 /2 < X < 1 



while taking P — I. The conjecture appears entirely reasonable, at least at the numerical 
level: inserting this into the Gibbs measure yields a graph that appears to be more-or-less 
equal the Question Mark, up to the level of numerical detail that can be easily achieved. 
However, Mayer does not provide a proof; rather, he intimates that it must be so. 

In fact, numerical efforts suggest that there are many functions V that 'come close', and 
can serve as the basis for similar conjectures. Loosening the restriction of translational 
invariance, to allow Hamiltonians of the form H (a) ~ Lr=o ■^'^^ (T*ff) seems to allow an 
ever richer collection of functions V that seem to yield good numerical approximations to 
the Question Mark. The Kac potential is simply a special case; as originally formulated, it 
is an interaction that decays exponentially with distance. Written in terms of interactions 
between lattice points, this exponential decay takes the form H (a) — YZ=o ^''^(K^k ■ Why 
there should be so many functions that seem to approximate the Question Mark is not 
immediately apparent; perhaps it is a statement that functions that seem 'close to' one 
another with respect to the natural topology of the unit interval have dramatically different 
form when considered in the product topology of the lattice model. 

An exact solution has been promised; it is time to present it. To obtain the solution, one 
'follows one's nose'. Starting with 

Pr(x) = (?'o?-i) (x) 
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one clearly has Pr(?(x)) —7'{x). Applying the symmetry from eqn ll.2l one has that 

The division-by-two is to be recognized as the shift operator. That is, using translational 
symmetry [33] in the Gibbs measure [J!2] vields 

Pr((7) =exp(-y ((7))Pr(TC7) 

or equivalently, 

^(2|))=„p(-v(M))p,p„) 

Back-substituting, one deduces 



On the other hand, one has 

7'{x) 1 



(4.3) 



(l+x)2' Vl 



which may be obtained by naively differentiating the relation 11.21 The casual use of dif- 
ferentiation here deserves some remark; it is justified by the same arguments that allow the 
"abuse of notation" of eqn 12.21 it holds as an identity involving the measure, a change of 
variable taken under the integral sign. 

Continuing, the last result is plugged in to yield 

— exp —V ' 



2 " V V 2 

which is then easily solved for V to obtain 



(4.4) y(y)=log2 + 21og(l-?-i(3;)) forO<3'<^ 

and V{y) ^V{1 ~y) for 1/2 < x < 1. When graphed, this potential does appear to be 
vaguely tent-like, thus perhaps accounting for at least some of the success of the Kac 
potential to model the Question Mark. 

To summarize the nature of this result: the potential 14.41 let us call it the 'Minkowski 
potential', when used to formulate a lattice energv [34l gives the Minkowski measure |4TT] 
as a Gibbs measure [J!2l The next section of this note deals with the form that this measure 
when it is plugged back into this sequence of equations, culminating in the Gibbs measure. 



5. A Product of Piece-wise Functions 

Plugging the exact solution l4~4l into the Hamiltonian l3.4l and this in turn into the Gibbs 
measure !?^ allows the derivative to be written as a product of piece-wise continuous func- 
tions. The result may be written as 

(5.1) ny)-fl^^ 

with the derivation given below. 
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Putting together the Hamiltonian 1 3 .4 1 with the Gibbs measure [l!2l vields 

(5.2) P^(-) = ^nexp(-y(.^a 

To obtain expHcit expressions, it is convenient to perform the change-of-variable ji[:(o') 
? {y) so that eqn l4.1 [ becomes 

(5.3) Pr((j)=?'(y) 
while eqn l4.4l contributes 

I ^ for < y < A 

_ J 2 I-v - --^ - 2 



(5.4) exp-V(f7) =exp-V((x"'o?) (y)) 




for i < y < 1 



The remaining terms of the product need to be evaluated by considering the action of the 
shift operator T. Under the change of variable y —T^^ (x (a)), this is given by 

^xir'^a)) =?-^ f frac (2*^? (y) 



where 'frac' is the fractional part of a number, that is, frac(jic) =x — [x\. The general 
expression follows by iterating on the function 



(5.5) AW=?-'(frac(2?(y))) = <;^;^i 

Define the iterated function A/^ so that 



forO<3'<i 
I 

2 



for ^ < y < 1 



^^+1 iy)^AkoA{y) 

with Ao (y) — y and Ai (y) —A{y). Each of the A^. is made of 2'' Mobius transforms joined 
together; so, for example, the next one is 



^2(3') 



forO<3'<i 
^ for i < X < i 
^ for i < X < I 

for|<x<l 



A graph of the first four is shown in figure |5] 

The structure of the product can be elucidated by relating it to the structure of the dyadic 
monoid l|27l[28l . A monoid is just a group, but without inverses; composing two elements 
of a monoid simply gives another element of the monoid. The dyadic monoid is a free 
monoid generated by two elements g and r, with with — e, and no constraints on g. 
Alternately, the dyadic monoid is the free monoid in two letters L and R; both presentations 
can be seen to correspond to a binary tree. In essence, the dyadic monoid consists of those 
moves that can navigate down an infinite binary tree. 

In this problem, the functions g{y) =y/{l —y) and r{y) — I —y are the generators of 
the dyadic monoid. Explicitly, this is 



1/1+ 1 



(3') = 



{AkOg){y)^Ak(^j^^ fovO<y<^ 
{Akorogor){y)^Ak[^) for i < y < 1 
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Alternately, it can be convenient to work with the left L — g and right R = r o g o r trans- 
formations, with locations in the binary tree identified by navigating to the left or right, 
composing L and R. 

The functions A<.(y) reach their extremal values 0, 1 at exactly the Farey fractions from 
the A:'th row of the Stern-Brocot tree. These may be obtained by following the movement 
of the endpoints 0,1 through successive iterations of L^^ and R^^. These navigate "up" 
the tree; a point of occasional confusion can be avoided by keeping in mind that taking the 
inverse reverses the order of symbols, i.e. so that (/ o /i) ^ ^ — o For example, the 
third row of the Stern-Brocot tree is given by ^ = L^' oL^' oL^^ Q), j — L^^ oL^^ o 
l=L-^oR-^-oL-^i), l=L-'o/^-i'o7^-'(i), l=/?-'oL-''oL-'(i), 
and so on. Note that the order seems "reversed" from the natural order in which one might 
be tempted to apply the transformations L and R. 

Of particular importance is that exponential of the potential, eqn 15.41 is just half the 
derivative of A; 

exp-(yo^-io?) (y)^^ 

This is perhaps the most remarkable result of this development. That the Gibbs measure 
can be written as infinite product is no surprise. That repeated applications of the shift 
operator t can be represented as the iteration of a function comes as no surprise either. That 
A is the proper form for the shift operator just follows from a study of the self-similarity 
properties of the Question Mark. However, the expression for the potential V, as written 
out in eqn 14.41 seems opaque, and without particular significance. Here, the potential is 
revealed to be "not just some arbitrary function", but related to the shift itself, and, more 
deeply, to the inverse of the transfer operator for the system. This is examined in greater 
detail below. 

Several final remarks are in order. First, one has that the partition function Z (fi) = 1 . 
This is no accident, but was guaranteed by construction. Forcing the relationship 14.11 
together with the fact that ?(1) = 1, ensured that all normalizing factors that might have 
appeared in Z(i2) were in fact absorbed as an additive constant by V. Explicitly, this 
scaling leads to the constant log 2 in eqn 14.41 

Caution should be used when evaluating eqn |5T| numerically. If preparing a graph, or 
numerically evaluating its integral, the product is best terminated after < log2 h itera- 
tions, where h is the step size for the numerical integral. Using larger than this bound 
will result in a very noisy sampling, resulting in numerical values that no longer resemble 
those of ?'. 



6. An Application 

Perhaps the most immediate utility of the expression 15. II is to provide insight into the 
question of "for what values of x is ?' (x) zero, and for what values is it infinite?"; such a 
question is explored in ||9] |24|. So, for example: the A^-'s can be seen to peel off binary 
digits, since A/^ {y) =?^' (frac (2*^? (y)))- Thus, for any rational p/q, one has that ? {p/q) 
has a finite length when expressed as a string of binary digits, and so one immediately has 
thatAii{p/q) —Owhenk> N: the infinite product terminates, and so of course, l{p/q) =0. 

More interesting is the case of y a quadratic irrational. In this case, ? [y) is a rational 
number, and Af, {y), after a finite number M iterations, settles into a periodic orbit, of length 
N — 2^ (of course: this is a basic result from the theory of Pellian equations). In this case. 
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eqn IS.ll mav be written as 




2 



Clearly, the periodic part dominates. Writing 



2 



Piy) = Y{ 



as the contribution from the periodic part, one clearly has that ?' (y) = whenever P (y) < 
1, and ?'(3') = °° when P{y) > 1. The accomplishment of |9| is to provide a precise 
description of these two cases, in terms of the continued fraction expansion of y. Thier 
work would appear to rule out any cases where P{y) = 1. 



The self-similarity transformation of the Minkowski measure resembles that of a modu- 
lar formH . That is, given a general Mobius transform of the form {ax + b)/ {cx + d), with 
integers a,b,c,d and ad — be ^ 1, the self-similarity of the Question Mark takes the form 



for some integers M,N,Q,K, depending on the specific values of a,b,c,d. Differentiating 
the above, one readily obtains 



which is almost exactly the transformation rule for a modular form of weight 2, and is 
spoiled only by the factor of 2^. This suggests a conjecture: that there may be some 
holomorphic function, defined on the upper half -plane, for which the Minkowski measure 
is the limit of that function, the limit being taken a it's restriction to the real axis. Work 
in this direction has been given by AUcauskas, who notes that the generating function for 
the moments of the Question Mark is holomorphic in the upper half -plane [2 1, and gives 
Question Mark analogs for Maas wave forms, zeta functions and similar number-theoretic 
constructs [1]. 

This conjecture is best illustrated numerically. Take the product lSTl terminating it after 
a finite number of terms; then graph the reciprocal. The resulting graph fairly resembles 
what one would obtain, if, for example, one graphed the Dirichlet eta function along a 
horizontal line, not far above the real axis. The Dirichlet eta is a prototypical example 
of a modular form; that it is also commonly written as an infinite product is also highly 
suggestive. These similarities bear further elucidation. 



Several generalizations of the Question Mark are possible. To properly define these, it 
is imperative to regain a clearer view of the lattice model methods. The ingredients to the 
above construction made use of a lattice i2, whose states are encoded by a, a lattice shift 
operator T, and a function V on the lattice, to define a probability: 



7. Modularity 





8. Generalized Shifts 
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A second important ingredient was to provide an explicit identification of lattice arrange- 
ments (7 with real numbers y in the unit interval, given by a = x^' (? (y)). Putting these 
together lead to the form 

(8.1) ^/W-77^n/°^^W 

^ l^'^J A:=0 

The appearance of A^t is explicitly governed by the mapping a = x^^ {1 (y)); that is, Ai. is a 
representation of the shift operator z'^, for this particular mapping of a to real values. The 
function / is simply exp — V, that is, 

f{y)=exp-V{x-'{7{y))) 

The partition function Z (H) is re-introduced as a reminder that the product definition only 
makes sense when / has been scaled appropriately; that is, / is arbitrary only up to a 
total scale factor; / exists only as an element of a projective space. The constraint is even 
stronger: inappropriately normalized / lead to the formal divergence Z (H) = there is 
only one normalization for which Z (H) is finite, and that furthermore, by construction, one 
then has Z(i2) = 1. 

Two directions along which to generalize are now clear: one may consider general 
functions V, and one may consider different mappings from the lattice space £2 to the unit 
interval (which is compact, or to the positive real number line, which is not). The former 
is reviewed in a later section; lets treat the later case first. 

The most widely used mapping is usually the much simpler one: x = x{a) from eqn 
13.11 instead of the "twisted" mapping a = x^' (? (y)). The simpler mapping is commonly 
used to study the Bernoulli shift||2l[l0l[T4 |, an entry-point into the study of exactly solvable 
models in symbolic dynamics. By contrast, the "twisted" mapping allowed the Question 
mark to be properly formulated, thus suggesting that other maps can lead to other interest- 
ing objects. A twisted version of the Bernoulli Map will be presented in a later section. 

The product form 15.1 [ provides the most concrete avenue immediately for generaliza- 
tions. One need only to consider any differentiable, piece-wise function A and use it in 
the construction. For example, one may consider a 3-adic generalization of the Question 
Mark, which may be constructed from 

for < y < i 

(8.2) B{y)^{3y-1 foi-^<y<^ 




3 3 

^ for I < y < 1 

. y 3 —-^ — 

and using B instead of A to generate the iterated function in the product, and B' /3 in place 
of A'/2 as the initial term. The resulting function, let's call it 7'^, can be integrated to define 
a 3-adic Question Mark ?3, which is shown in figure [8] Note that it requires no further 
normalization; one has ?3(1) = 1 simply by having divided B' by the number of branches 
of B. Visually, it is considerably less dramatic than the 2-adic Question Mark; hints of its 
self-similarity are visible but less apparent. 

By construction, it has self-similarity, which can be immediately read off by considering 
so, for example: 

>{x) 



and 

1 7(x) 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 
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and 




These self-similarity properties allow an explicit equation for ?3 to be written out, entirely 
analogous to Denjoy's explicit summation given in eqn il.ll although now one must con- 
sider even and odd values of the terms of the continued fraction distinctly. Whether, or 
how, the 3-adic, or, more generally, a p-adic Question Mark, inter-relate to results from Pel- 
lian equations, is not clear Of some curiosity is whether the feature that caused Minkowski 
to present the Question Mark, namely, Lagrange's result relating periodic continued frac- 
tions to quadratic irrationals, might somehow be reflected, e.g. with cubic irrationals, and 
an appropriate 3-adic redefinition of periodicity in the continued fraction. 

9. The Transfer Operator 

The mapping from the lattice to the unit interval results in a collision of effects coming 
from different topologies: the product topology of the lattice, and the natural topology of 
the real number line. A standard way of studying this collision is by means of the transfer 
operator. The goal of this section is to introduce the transfer operator; the next section will 
show, in general terms, that the transfer operator is the push-forward of the shift operator 

Transfer operators, also commonly called Perron-Frobenius operators or Ruelle-Frobenius- 
Perron operators, as formulated by David Ruelle[xxx need ref] are a commonly-used tool 
for studying dynamical systems|7|. As the name "Perron-Frobenius" suggests, these are 
bounded operators, whose largest eigenvalue is equal to one; the existence of this eigen- 
value is commonly justified by appealing to the Frobenius theorem, which states that 
bounded operators have a maximal eigenvalue. The origin of the name "transfer" comes 
from lattice-model physics, where the "transfer interaction" or the "transfer matrix" is 
studied ifTSl [TTl [T9]| . In this context, the "transfer interaction" describes the effect of neigh- 
boring lattice sites on one-another; it can be seen to be a matrix when the number of 
interacting lattice sites, and the number states a single lattice point can be in, is finite. It 
becomes an operator when the interaction is infinite range, or when the set of states at a 
lattice site is no longer finite. 

It is easiest to begin with a concrete definition. Consider a function g : [0, 1] — > [0, 1], 
that is, a function mapping the unit interval of the real number line to itself. Upon iteration, 
the function may have fixed points or orbits of points. These orbits may be attractors or 
repellors, or may be neutral saddle points. The action of g may be ergodic or chaotic, 
strong-mixing or merely topologically mixing. In any case, the language used to discuss 
g is inherently based on either the point-set topology of the unit interval, or the "natural" 
topology on the unit interval, the topology of open sets. 

A shift in perspective may be gained not by considering how g acts on points or open 
sets, but instead by considering how g acts on distributions on the unit interval. Intuitively, 
one might consider a dusting of points on the unit interval, with the local density given by 
p (x) at point x E [0,1], and then consider how this dusting or density evolves upon iteration 
by g. This verbal description may be given form as 

(9.1) p'{y)^ [' 5{y-g{x))p{x)dx 

Jo 

where p'{y) is the new density at point y = g{x) and 8 is the Dirac delta function. 

In this viewpoint, g becomes an operator that maps densities p to other densities p', or 
notationally, 

^,p=p' 
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The operator is the transfer operator or the Ruelle-Frobenius-Perron operator. It is not 
hard to see that it is a hnear operator, in that 

^g{ap\ +bp2) = a^gPi + b^gP2 

for constants a,b and densities pi,p2- 

When the function g is differentiable, and doesn't have a vanishing derivative, the inte- 
gral formulation of the transfer operator |9T| can be rephrased in a more convenient form, 
as 



(9.2) [^sp]iy) 



where the sum is presumed to extend over at most a countable number of points. That is, 
when g is many-to-one, the sum is over the pre-images of the point y. 

Both of the above definitions are mired in a language that implicitly assumes the real- 
number line, and its natural topology. Yet clearly, this is of limited utility; extracting the 
topology from the definition provides a clearer picture of the object itself. 

10. The Push-Forward 

In this section, it will be shown that the transfer operator is the push-forward of the shift 
operator; a theorem and a sequence of lemmas will be posed, that hold in general form. 
The point of this theorem is to disentangle the role of topology, and specifically, the role of 
measure theory, from the use of the shift operator We begin with a general setting. 

Consider a topological space X, and a field F over the reals R. Here, F may be taken to 
be R itself, or C or some more general field over R. The restriction of F to being a field 
over the reals is is required, so that it can be used in conjunction with a measure; measures 
are always real-valued. 

One may then define the algebra of functions ^{X) on X as the set of functions 
/ G ^{^) such that f : X ^ F. An algebra is a vector space endowed with multipli- 
cation between vectors. The space ^{X) is a vector space, in that given two functions 
/i)/2 G -^{X)^ their linear combination af\ +bf2 is also an element of J^{X); thus /i 
and /2 may be interpreted to be the vectors of a vector space. Multiplication is the point- 
wise multiplication of function values; that is, the product /1/2 is defined as the function 
(/i/2)W = fiix) ■ fiix), and so /1/2 is again an element of ^{X). Since one clearly has 
/1/2 = /2/1, multiplication is commutative, and so ^{X) is also a commutative ring. 

The space may be endowed with a topology. The coarsest topology on J^{X) is 

the weak topology, which is obtained by taking ^{X) to be the space that is topological 
dual to X. As a vector space, ^{X) may be endowed with a norm |j/||. For example, one 
may take the norm to be the L^-norm 

Il/llp =(/ imfdxy' 

For p = 2, this norm converts the space J^{X) into the Hilbert space of square-integrable 
functions on X. Other norms are possible, in which case ^{X) has the structure of a 
Banach space rather than a Hilbert space. 

Consider now a homomorphism of topological spaces g :X —^Y. This homomorphism 
induces the puUback g* : ^{Y) — > ■^{X) on the algebra of functions, by mapping / 
8* if) = f °8 so that f og:Y F . The pullback is a linear operator, in that 

g*{an+bf2)^ag*{n)+bg*{f2) 
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That the pullback is linear is easily demonstrated by considering how g* f acts at a point: 
{g*f) (x) = {fog) (x) = f{g{x) ) and so the linearity of g* on afi + bf2 follows trivially. 

One may construct an analogous mapping, but going in the opposite direction, called 
the push-forward: g^, : ,^{X) ^{Y). There are two ways of defining a push-forward. 
One way is to define it in terms of the sheaves of functions on subsets of X and Y. The 
sheaf-theoretic description is more or less insensitive to the ideas of measurability, whereas 
this is important to the definition of the transfer operator, as witnessed by the appearance of 
the Jacobian determinant in equation |9.2| By contrast, the measure-theoretic push-forward 
captures this desirable aspect. It may be defined as follows. 

One endows the spaces X and Y with sigma-algebras {X,S!/) and {Y,£§), so that £/ 
is the set of subsets of X obeying the axioms of a sigma-algebra, and similarly for 
A mapping g : X ^ Y is called "measurable" if, for all Borel sets B G one has the 
pre-image g^^{B) G si/ being a Borel set as well. Thus, a measurable mapping induces a 
push-forward on the sigma-algebras: that is, one has a push-forward : ^(s/) ,^{!^) 
given by / ^ g*if) — f °8 \ which is defined by virtue of the measurability of g. The 
push-forward is a Unear operator, in that 

g.ian+bf2)=ag4fi)+bg4f2) 

One regains the transfer operator as defined in equation l9.2l bv considering the limiting 
behavior of the push-forward on progressively smaller sets. That is, one has 

Theorem 10.1. The transfer operator is the point-set topology limit of the measure-theoretic 
push-forward. 

Proof. The proof that follows is informal, so as to keep it simple. It is aimed mostly at 
articulating the language and terminology of measure theory. The result is none-the-less 
rigorous, if taken within the confines of the definitions presented. 

Introduce a measure ji : si/ ^ and analogously v : 3§ ^ M+. The mapping g is 
measure-preserving if v is a push-forward of jj., that is, if v = g^:jX — jJ-og^^. The measure 
is used to rigorously define integration on X and Y. Elements of ."^{si/) can be informally 
understood to be integrals, in that /(A) for A G s^/ may be understood as 



/(A)= ( f{z)dii{z)^ j f{z)\li\z)\dz 

J A J A 



where is to be understood as the Jacobean determinant at a point x G X. Here, / 

can be understood to be a function that is being integrated over the set A, whose integral is 
denoted by /(A). The value of / at a point x G X can be obtained by means of a limit. One 
considers a sequence of A G s/, each successively smaller than the last, each containing 
the point x. One then has 

which can be intuitively proved by considering A so small that / is approximately constant 
over A: 

f{A)= [ f{z)dfi{z)^f{x) [ d^i=f{x)fi{A) 

J A J A 

To perform the analogous limit for the push-forward, one must consider a point y G F 
and sets B G ^ containing y. In what follows, it is now assumed that g : X ^ Y is a. 
multi-sheeted countable covering of Y by X. By this it is meant that for any y that is not 
a branch-point, there is a nice neighborhood of y such that its pre-image consists of the 
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union of an at most countable number of pair- wise disjoint sets. That is, for y not a branch 
point, and for B 9 y sufficiently small, one may write 



g-\B)=AxVJA2VJ- 



7=1 



where A: is either finite or stands for oo, and where A, n Ay = for all j. At branch points, 
such a decomposition may not be possible. The axiom of sigma-additivity guarantees that 
such multi-sheeted covers behave just the way one expects integrals to behave: in other 
words, one has 

^{g-\B))=^I^A)j=t^{Aj) 

whenever the collection of A , are pair- wise disjoint. Similarly, in order to have the elements 
/ e ^{^) behave as one expects integrals to behave, one must restrict ^{s^/) to contain 
only sigma-additive functions as well, so that 



/(g-i(B))=/(^UA,J=£/(A,) 

As the set B is taken to be smaller and smaller, the sets Aj will become smaller as well. 
Denote by Xj the corresponding limit point of each Aj, so that g{xj) = y and the pre-image 
of y consists of these points: g^^{y) = {xi ,X2, • • • | g{xj) = J } • One now combines these 
provisions to write 



[g*f\ {y) = im 

Bay 

= lim 



(g*/)(g) 
V(B) 

{fo8-')iB) 



Um 



Aj3g-Hy) 



= lim 



v(s) 

/(AiUAzU---) 

v{B) 



(10.1) 



k 



v(B) 



v{B) 



The limit in the last line of this sequence of manipulations may be interpreted in two ways, 
depending on whether one wants to define the measure V on 7 to be the push-forward of 
H, or not. If one does take it to be the push-forward, so that V = g*n, then one has 



lim 



1 



Aj^jSMB) \g'{xj)\ 

where |^'(xy)| is the Jacobian determinant of g at Xj. This last is a standard result of 
measure theory, and can be intuitively proved by noting that g {Aj) = B, so that 

V(B)= / g'{z)dn{z)^g'{xj)n{Aj) 
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for "small enough" B. Assembling this with the previous result, one has 

,/(^;) 

.v,ei'-'(v) 



which may be easily recognized as equation |9.2| This concludes the proof of the theorem, 
that the transfer operator is just the point-set topology limit of the push-forward. □ 

In simplistic terms, the push-forward can be thought of as a kind of change-of-variable. 
Thus, one should not be surprised by the following lemma, which should be recognizable 
as the Jacobian, from basic calculus. 



Lemma 10.2. (Jacobian) One has 



Proof. This follows by taking the limit Aj 3 xj of 



and then summing over j. □ 

Corollary 10.3. The uniform distribution fi.e. the measure) is an eigenvector of the trans- 
fer operator, associated with the eigenvalue one. 

Proof. This may be proved in two ways. From the viewpoint of point-sets, one simply 
takes / = const, in equation |l0.2| and applies the lemma above. From the viewpoint of the 
sigma-algebra, this is nothing more than a rephrasing of the starting point, that v = 
and then taking the space Y = X, so that the push-forward induced by g : X X is a 
measure-preserving map: g^^pL — pL. □ 

The last corollary is more enlightening when it is turned on its side; it implies two 
well-known theorems, which follow easily in this framework. 

Corollary 10.4. (Ruelle-Perron-Frobenius theorem). All transfer operators are continu- 
ous, compact, bounded operators; furthermore, they are isometries ofBanach spaces. 

Proof. This theorem is of course just the Frobenius-Perron theorem, recast in the context 
of measure theory. By definition, the measures have unit norm: that is, = 1 and 

II V II 1 = 1. This is nothing more than the statement that the spaces X and Y are measurable: 
the total volume of X and Y is, by definition, one. Since v = we have ||^*/x||j — 1, 
and this holds for all possible measures pL ^ ,^{X). 

Recall the definition of a bounded operator. Given a linear map T : U ^ V between 
Banach spaces C/and V, then T is bounded if there exists a constant C < °° such that 
||7"m|Iv < C II m II f/ for all u E U. But this is exactly the case above, with T = g^,, and U = 
^{X), V — ^{Y), and C = 1. The norm of a bounded operator is conventionally defined 
as 

||r||=sup|^= sup ||r«||^. 

!(/0 ll«llt/ ||w||(/<l 

and so we have the norm of being ||g*|| = 1. That is an isometry follows trivially 
from ll^t^lli = Iklii and that is linear. □ 
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The corollai'v ll0.3l can also be treated as a corollary to the Perron-Frobenius theorem: 
namely, that there is at least one vector that corresponds to the maximum eigenvalue of g*. 
This eigenvector is in fact the Haar measure, as the next theorem shows. 

Theorem 10.5. (Haar measure) For any homomorphism g :X —> X, one may find a mea- 
sure H such that = ll; that is, every homomorphism g ofX induces a measure jl onX 
such that g is a measure-preserving map. If g is ergodic, then the measure is unique. 

Proof. By definition, /i is a fixed point of g*. The fixed point exists because g+is a bounded 
operator, and the space of measures is compact, and so a bounded operator on a compact 
space will have a fixed point. The existence of the fixed point is given by the Markov- 
Kakutani theorem||8] p 456]. The Markov-Kakutani theorem also provides the uniqueness 
condition: if there are other push-forwards /i* that commute with then each such push- 
forward will also have a fixed point. The goal is then to show that when g is ergodic, 
there are no other functions h that commute with g. But this follows from the definition 
of ergodicity: when g is ergodic, there are no invariant subspaces, and the orbit of g is 
the whole space. As there are no invariant subspaces, there are no operators that can map 
between these subspaces, i.e. there are no other commuting operators. □ 

A peculiar special case is worth mentioning: if g is not ergodic on the whole space, 
then typically one has that the orbit of g splits or foliates the measure space into a bunch of 
pairwise disjoint leaves, with g being ergodic on each leaf. The Markov-Kakutani theorem 
then impUes that there is a distinct fixed point /i in each leaf, and that there is a mapping 
that takes jU in one leaf to that in another 

In the language of dynamical systems, the push-forward is commonly written as 
so that one has 

g*=^s- ■^[X)^.^{X) 
now being called the transfer operator or the Ruelle-Frobenius-Perron operator. 

In the language of physics, the fixed point /i is called the "ground state" of a system. 
When it is unique, then the ground state is not degenerate; when it is not unique, then the 
ground state is said to be degenerate. The operator is the time-evolution operator of the 
system; it shows how physical fields / G ■'^{X) over a space X evolve over time. When F 
is the complex numbers C, the fact that \\g^ || = 1 is essentially a way of stating that time- 
evolution is unitary; the Frobenius-Perron operator is the unitary time-evolution operator 
of the system. What is called "second quantization" in physics should be interpreted as the 
fitting of the space ."^{X) with a set of basis vectors, together with a formulation of in 
terms of that basis. 

11. Grounding 

To make sense of the discussion above, it is best to ground the symbols in concrete 
terms. For the most part, this paper is concerned either with the space X — Q., the space 
of strings in two letters, or the spaces X that are somehow isomorphic to fi. This space 
is a (semi-)infinite product of Z2 = {0,1}; elements a G £2 of this space were values 
a = ((7o, (Ti , • • • ) with each Ok&l'i- The function g is to be equated with the shift operator 
t; that is, when X — £1, then g = T and g :X ^ X \s just T : H ^ £2 acting such that 
T(ao, (7i , (72i • ■ ■ ) ~ (Ci ■ 02, • ■ ■ )■ Curiously, the structure of the space £2 can be understood 
to be a binary tree: starting at Oq one can branch to the left or to the right, to get to a\, and 
so on. 

However, a product space such as this also allows the language of probability theory 
come into play, and this is where things get interesting. Product spaces are the natural 
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setting for probability: the element (7 e i2 is called a "random variable", and successive 
values (7^. of a are "measurements" of the random variable. This is also the manner in 
which measure theory gets its foot in the door 

The mappings x : i2 R given by eqn l3.1l is now seen to be an example of an element 
of the algebra ^ (i2) introduced in the previous section. The mapping y : i2 ^ R given by 
a y [x [a)) is another example of an element of ^ Both the maps x and y 
are invertible on the closed unit interval, establishing a 1 — 1 mapping between i2 and the 
closed unit interval. Consider, then, the function 

b — xo Tox^^ 

which maps the unit interval into itself. Let w G [0, 1] be a point in the unit interval; then it 
is not hard to see that b{w) — 2w modi. In the literature, this function is commonly called 
the Bernoulli shift; its corresponding random process is known as the Bernoulli process. It 
is well-known to be ergodic. The other function that was considered here was 

A = y o Toy^' 

Its action on the unit interval has already been given by eqn 15.51 In a previous section, 
we have already considered generalizations of A, that is, other functions B : [0, 1] [0, 1], 
such as, for example, eqn l8.2l 

Each of these maps b. A, B can be taken as an example of a map g : X ^ X, now 
taking X to the unit interval. By the previous theorem, each such map g induces a push- 
forward = ^ij, and this push-forward then induces an invariant measure fig on X. For 
the Bernoulli map b, the push-forward Ji'i, is introduced in fllT.TTl, and a detailed study of 
its eigenvectors is given in [7J. The invariant measure jij, is well-known to be the identity 
on the unit interval. This just means that b considered as an ergodic process, is uniformly 
distributed on the unit interval. 

The invariant measure /Xa has already been established in eqn l5.3l as being the Minkowski 
measure; that is, jiA =?'. A previous section already gave specific mechanics for how to 
take a general, differentiable onto-map B and define its associated measure fig as the gen- 
eralization of eqn l5.1l The next section introduces the transfer operator J^a corresponding 
to A. 

12. Flightiness 

To counterbalance the grounding, a brief bit of flightiness as to this process is offered up. 
The map g :X ^X, can be understood to give the (discrete) time-evolution of a dynamical 
system. It is curious to ponder that iterating on g can be thought of as the (semi-)infinite 
chain 

8 8 8 8 

X — > X — > X — > X — 

and so any particular g, together with an initial state jcq, can be interpreted as selecting a 
particular sequence (xo,xi,X2, . . .) out of the (semi-)infinite product space Q. = X xX xX x 
■ ■ ■ . The set membership function now defines a measure on Q.: sets that contain sequences 
{x(),xi,X2, ■■■) for which x^+i = g {xk) will have a non-zero measure, all others will have a 
measure of zero. 

This now begs the question: what sort of operators might have this measure as their 
invariant measure? Which of these operators might be push-forwards? What would they be 
a push-forward of? Last but not least, all of the spaces under discussion are metric spaces, 
and so it would be curious to know "what else is close by". These questions are not entirely 
idle. The product space £2 can be seen to be "space-time"; by analogy to the binary tree, the 
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Structure of this space-time is once-again seen to be branching; infinitely branching in this 
case. But such branching is already seen in physics: this is nothing other than the many- 
worlds interpretation of quantum mechanics, where, at each moment of time, the universe 
is seen to split into an "infinite number of copies of itself". The many-worlds branching 
structure is not well-explored or well-understood, although it can illuminate problems of 
quantum measurement and wave-function collapse. One such curious problem is the so- 
called "Mott problem". The question that is posed by the Mott problem is "why does the 
spherically symmetric wave function of an emitted alpha-ray always leave a straight-line 
track in a Wilson cloud chamber?" The answer was given by Heisenberg and Mott in 1929; 
it can be re-interpreted in the context of the current formulation as a statement that straight- 
line tracks have a significant measure in this branching space-time, as they all correspond 
to sets that contain the point that is the spherical wave-function, and no other sets do. But 
this is just an impressionistic daydream, lacking rigor; the hope is that it motivates further 
study. 

13. The Twisted Bernoulli Operator 

We know consider the transfer operator or push-forward of the mapping A. It resembles 
the Bernoulli shift, but is twisted in a curious way; it is a continued-fraction analog of the 
Bernoulli shift. It is straight-forward to write it down; it is very simply 

i-^Af]iy)^—^f(j^)+—^f( ^ 



{l+yf \^+yJ {2-yf- \2~y, 
This expression is easily obtained by starting with eqn l9.2l and substituting in A from eqn 



Of course, the Minkowski measure is an eigenfunction of the twisted Bernoulli shift; 
that is, one has 

^A?' =?' 

This can be seen in several ways: The Minkowski measure was constructed as the Haar 
measure induced by A, and so by theorem FlO.SI of course this identity must hold. But it 
can also be validated by more pedestrian methods: simply plug in the measure, and turn 
the crank, making use of the self-similarity relation |43] as needed. 

Other eigenvalues of ^ are readily found. Consider the functions Pf as previously 
defined in l8.ll The action of the twisted Bernoulli shift on Pf is readily computed, and is 
given by 

We've already observed that when / — A'/2, that one has 

so that Pj^i 12 =?'. Another, linearly independent function which satisfies ^/ = 1 is 

U fori<,<l 

that is, = 1 . The collection of all / which satisfy ^/ = 1 is large, but is easily 
specified. Let 



fiy) 




for < y < ^ 
for i < y < 1 
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then, given any, arbitrary /o, solving for J^A.f = 1 results in 

/i (y) ^ - -- — -t/o 



Such / are properly scaled; that is, Z(n) — 1. Because of the projective relationship 
between the functions / and the normalization Z(i2), it appears that this technique will 
result in only in eigenfunctions Pf with eigenvalue 1 . 

Keep in mind that while Pf+h 7^ Pf+Ph, such a relationship does hold in the logarithmic 
sense: Pfh = PfPh, thus P can be construed to be a (logarithmic) homomorphism of Banach 
spaces. 

14. The Gauss-Kuzmin-Wirsing Operator 

The Gauss-Kuzmin-Wirsing (GKW) operator is the transfer operator of the Gauss map 
h{x) — 1 /x — [1 /-xj • Denoting this operator as it acts on a function / as 

(14.1) [^G/]w=i:-^/(^ 

„=l [x + n) \x + 

It has a well-known eigenfunction, noted by Gauss, corresponding to eigenvalue one, given 
by 1/(1 It is also relatively straightforward to observe that the Minkowski measure 
is also an eigenfunction, also corresponding to eigenvalue one. This is readily seen by 
differentiating the Question Mark: one has 

+ xj 2"-i 2" 
and so also 

1 \ _ {x + nf 



^n+xj 2" ^ ' 

This can be directly inserted into eqn |14.1| to very easily find that 

^G?' =?' 

This result immediately suggests that the lattice model methods can be applied to find other 
eigenfunctions. (The 3-adic Question mark, at least as defined above, does not have this 
property, suggesting, perhaps, that a different 3-adic Question Mark should be formulated). 

The GKW operator is tied by various identities to the twisted Bernoulli operator ^a- 
These are explicitly given by |15|; they can be crudely summarized by saying that these 
identities are the operator analogues of Denjoy's explicit summation ll.il In these rough 
terms, the Bernoulli shift chops off one digit of a binary number at a time; these may be in 
turn re-assembled to form the GKW operator. 

The action of the GKW operator on Pf is not at all tidy; the result that ^q]' =?' 
appears to be entirely due to the special form of / ^A! 12. This is perhaps no surprise. 

Other fractal eigenvectors for the GKW can be constructed by a technique entirely par- 
allel to the construction of the Pf, one only has to use a different lattice model. For this 
model, construct 

where /ijt+ 1 = /i^ ohv& function iteration, and h (y) — frac ( 1 /y) is the Gauss map, as before. 
This multi-branched h has the property that 

1 
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for all integer values of w > 1. Applying the GKW operator to Rf results in 

in analogy to the twisted Bernoulli operator. As before, functions / for which = 1 also 
implies that Z (fi) = 1 . The means for finding solutions to ^cf = 1 proceeds analogously 
to the twisted-Bernoulli case. Thus, let 

fiiy) fori<3;<l 

fn{y) forj^<y<l 

The equation ^cf — 1 establishes only a single constraint between these /,,'s: all but one 
may be freely chosen. As an example, consider 

Then the resulting / satisfies J^'cf = 1 provided that the constants satisfy 

oo 

On = 1 

n=l 

By taking «„ = 2~", one regains the Question Mark function; that is, Rf =?' for this case. 
Quick numerical work shows that other geometric series return commonly-seen "Question- 
Mark-like" distributions, while arithmetic series, such as fl„ = 6/ {nnf', yeild more unusual 
shapes. 

This construction shows that the space of fractal eigenfunctions of the GKW operator 
is very large; but, as before, all of these eigenfunctions appear to be non-decaying, corre- 
sponding to eigenvalue 1 . 

15. Conclusion 

The principal results of this note are the statistical distribution of equation |2.U which 
allows numerical explorations of the binary trees to be explicitly equated to the Minkowski 
measure, and equation [5.1| which shows that the seemingly intractable Minkowski measure 
can be built up out of analytic functions in a regular way. A sufficient amount of formal 
theory is developed to show how to root this measure in a foundational apparatus, thus 
enabling various basic manipulations on it to be safely performed. Of these, the most 
notable was to expose the transfer operator as a push-forward on a Banach space, leading 
to an interpretation of the Minkowski measure as a invariant measure, a Haar measure, 
induced by a dynamical map. 

An important theme is the tension between the natural topology on the real number line, 
and the much finer 'weak topology' on the product space 2*". It should be clear that the 
fine topology is the source for a broad variety for fractal, self-similar phenomena. The use 
of the measure extension theorem allows these fine-grained fractal phenomena to manifest 
themselves onto the natural topology; the Minkowski measure serves as a non-trivial yet 
still simple example of such a manifestation. 

However, various open problems remain. There have now been many generalizations of 
the Question Mark proposed; besides that of Denjoy, and those of this note, some additional 
ones are given in 1 3 1 . How are all of these related? How are they best enumerated? Is there 
some characteristic labelling that best distinguishes each form? Insofar as these variants do 
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occasionally manifest in practical problems, it is useful to have a coherent naming system 
to describe them. 

It is noted in section |4] that there seem to be many different functions V that generate 
functions Q that seem close to the Question Mark; in some cases, these appear to be arbi- 
trarily close, when studied numerically. Why is this? Is there an actual degeneracy, with 
different V 's generating the same function Ql If not, then the metric on the Banach space 
of different V's must be wildly folded to yeild close-by Q's: the metrics are not "compat- 
ible", or presumably discontinuous in some way. The structure of the mapping from the 
space of y 's to the space of Q's is opaque, and would benefit from clarification. 

Section[8]considered a 3-adic form of the Question Mark. Its construction is somewhat 
ad hoc, and it's properties are less than pretty: for example, it is not an eigenfunction 
of the GKW operator Is there another construction that would be an eigenfunction? If 
not, why not? If so, what would the correct p-adic generalization be? Let's assume that 
unambiguous p-adic generahzations can be written. What are thier connections to number 
theory? That is, ? (x) maps rationals to quadratic surds; is there an analogous statement for 
such ?p {x) ? 

The most important open problem surrounds the observation that transfer operators have 
a discrete spectrum, when the space of functions is restricted to a much smaller set, for 
instance, when it is restricted to the set of square-integrable functions, or to the set of poly- 
nomials. In some cases, this spectrum can be obtained exactly; this has been done for the 
case of the Bernoulli operator ifTOl [T4l ITI . More generally, obtaining this spectrum seems 
almost intractable; here, the prototypical example is the GKW operator Of course, both 
the eigenvalues and the eigenvectors of the GKW operator can be computed numerically; 
however, there is no articulation or framework within which to talk about them and to 
manipulate them; they remain opaque. They seem to be mere holomorphic functions (by 
definition!) and oscillatory, of course; but there is not yet a way of writing down their 
power series directly. 

Thus, the open problem is to develop a technical framework that will identify the dis- 
crete eigenvalue spectrum of a transfer operator, and to directly provide expressions for the 
holomorphic functions which are their eigenvectors. Such a framework would be of broad 
utility. 
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